User profiling with geo-located posts and demographic data
نویسندگان
چکیده
This paper presents a novel method for user profiling in social media that makes use of geo-location information associated with social media posts to avoid the need for selfreported data. These posts are combined with two publicly available sources of demographic information to automatically create data sets in which posts are labelled with socio-economic status. The data sets are linked by identifying each user’s ‘home location’. Analysis indicates that the nature of the demographic information is an important factor in performance of this approach.
منابع مشابه
Space-Time Aware Behavioral Topic Modeling for Microblog Posts
How can we automatically identify the topics of microblog posts? This question has received substantial attention in the research community and has led to the development of different topic models, which are mathematically well-founded statistical models that enable the discovery of topics in document collections. Such models can be used for topic analyses according to the interests of user gro...
متن کاملApplying geographical clustering methods to analyze geo-located open micro-blog posts
In this paper we conduct an exploratory geographical analysis of a sample of post data from the popular micro-blogging service Twitter for the period 22nd June to 12th October 2011 in the city of Leeds. For some user accounts clear patterns of daily activity are observed, and spatiotemporal concentrations of Twitter posts (tweets) are thought likely to represent, among other things, the residen...
متن کاملScaling laws in geo-located Twitter data
We observe and report on a systematic relationship between population density and Twitter use. Number of tweets, number of users and population per unit area are related by power laws, with exponents greater than one, that are consistent with each other and across a range of spatial scales. This implies that population density can accurately predict Twitter activity. Furthermore this trend can ...
متن کاملAugmenting Input Method Language Model with user Location Type Information
Geo-tags from micro-blog posts have been shown to be useful in many data mining applications. This work seeks to find out if the location type derived from these geo-tags can benefit input methods, which attempts to predict the next word a user will input during typing. If a correlation between different location types and a change in word distribution can be found, the location type informatio...
متن کاملSocial Media Text Processing and Semantic Analysis for Smart Cities
With the rise of Social Media, people obtain and share information almost instantly on a 24/7 basis. Many research areas have tried to gain valuable insights from these large volumes of freely available user generated content. The research areas of intelligent transportation systems and smart cities are no exception. However, extracting meaningful and actionable knowledge from user generated co...
متن کامل